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l/^ ■ Abstract 

Expressions for all order derivatives for the permanent of a matrix 
. and its sjnnmetric tensor powers are given. Norms of the derivatives of 

' symmetric tensor powers are evaluated exactly. From there, an upper 

pLn , bound for the norm of the derivatives of the permanent is obtained. 

r~| ' These are then used to give perturbation bounds for both these func- 

"ti , tions. 
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1 Introduction 



The Jacobi formula for the derivative of the determinant of a matrix has 
been well known for a long time. For the antisymmetric and symmetric ten- 
, sors powers, norms of their first derivatives had been calculated in [7] and 

^ I Us) respectively. Recently, Bhatia and Jain [8] have derived three formulas 

^ ' for higher order derivatives of the determinant each of which is a generali- 

sation of the Jacobi formula. After that, Jain [10] has also calculated higher 
order derivatives for the antisymmetric tensor powers. In the same spirit, 
here we obtain derivatives of all orders for the symmetric tensor powers. 
To obtain these, we first calculate the derivatives of the permanent func- 
! tion. We also evaluate the norm of the derivatives of the symmetric tensor 

powers and give a bound on the norm of the derivatives of the permanent 
function. As corollaries, we derive higher order perturbation bounds for 
these functions. 

Let (f : M(n) — > C be a differentiable map. The derivative of 99 at a n x n 
complex matrix A is the linear map D(p{A) : M(n) — > C whose action at 
X G M(n) is given by :- 



D^(A)(X) = I 



if{A + tX). (1.1) 

i=0 
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Then Dip is a linear map of M(n) into ^(M(n);C), the space of all lin- 
ear operators from M(n) into C. The second derivative of ip at A is the 
derivative of at A and is denoted by D'^ip{A). This is an element of 
^(M(n); (^(M(n);C)) which is identified with ^2(M(n);C), the space of 
bilinear mappings of M(n) x M(n) into C. Similarly, for any k, the A;*^ 
derivative of ip at A, denoted by D''p{A), is an element of ^fe(M(n); C), the 
space of multilinear mappings of M(n) x • • • x M(n) into C (See f9l). If (/? is 
fc-times differentiable, then ior , . . . , X'' G M(n), 

dV(^)(x\...,x'=) ^ 



dti--- dtk 



ipiA + hX^ + --- + tkX''). 

■■■=tfe=0 

(1.2) 

To state our results concisely, we need some multiindex notations which we 
briefly recall from m and 111. Let 

Qm,n = {{h, ■ ■ ■,im)\l < k < ' ' ' < im < n} 

and 

Gm,n = {(n, • • • ,^m)|l < k < ' ' ' < hn < n} . 

For m > n, Qm,n = 4>> by convention. Also, note here that for m < n, Qm,n 
is a subset of Gm,n- Given two elements I and J of Gm,n, let A[I\J] denote 
the mxm matrix whose (r, s)-entry is the (v, js)-entry of A. lfl,J^ Qm,n, 
then A[l\ J] is a submatrix of A. 

Let 7i be an n-dimensional complex Hilbert space. Let ^'^'H be the /c-fold 
tensor power of Ti and y'^'H be the symmetric tensor power of H. If {cj} is 
an orthonormal basis ofV., then for a = (ai, . . . ,ak) G we define = 
fiai V • • • V eoj.. If a consists of i distinct indices ai, . . . ,ae with multiplicities 
mi, . . . ,me respectively, put m{a) = mil ■ ■ ■ m^!. Note that if a e Qk,n, then 
m{a) = 1. An orthonormal basis of v'^Ti is {m(a)^^/^ ea : a e G^^n}- Let 
V*^^ denote the kth symmetric tensor poM^er of A. With respect to the above 
mentioned basis, the (a, /3)-entry of V^A is (m(a)m(/3))~-^/^ per A[a|/?] . 

If X, J" G Qm,n, then we denote by A{X\J), the (n — m) x (n — m) 
submatrix obtained from A by deleting rows X and columns ^. The j*^ 
column of a matrix X is denoted by X^^y Given n x n matrices X^,. . . 
and J = (ji, . . . , jm) G we denote by A{J; X\ . . . , X""), the matrix 

obtained from A by replacing the j*^ column of A by the j*'^ column of X'^ 
for 1 < p < m, and keeping the rest of the columns unchanged, that is, if 
Z = A{J] X^, . . . , X"'), then Zy^j = Xg j for 1 < p < m, and Z^p^ = Aj^j if 
^ does not occur in J. Let a be a permutation on m symbols, then by Y^jy 

we mean the matrix in which Y,"'- , = X'^^'^'' for 1 < p < m and Y,% = if £ 

bv\ bp] — f — [£j 

does not occur in J. 
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2 Permanent 



The permanent of A, written as per (A), or simply per A, is defined by 

per^ = Xl"Mi)"2a(2) •••a„a(n), (2-1) 

where the summation extends over all the permutations of 1,2,. . . ,n. Let 
per : M(n) — > C be the map taking an n x n matrix to its permanent. This is 
a differentiable map. We denote by D per A, the derivative of per at A. 
The famous Jacobi formula for determinant says that 

Ddet {A){X) = tr (adj {A)X), (2.2) 

where the symbol adj (A) stands for the transpose of the matrix whose {i, j)- 
entry is (—1)*+-' det A{i\j), called the adjugate (or the classical adjoint) of A. 
The permanental adjoint of A, denoted by padj (A), is the n x n matrix whose 
(i,j)-entry is peTA{i\j) (See lfT2ll . page 237). This difference of transpose 
in the definitions of adj and padj is just a matter of convention. We obtain 
the following result similar to the Jacobi formula for determinant. 

Theorem 2.1. For each X G M(n), 

D per {A){X) = tr (padj {AfX). (2.3) 

Proof. For 1 < i < n, let A{j; X) be the matrix obtained from A by replacing 
the j^^ column of A by the j*^ column of X and keeping the rest of the 
columns unchanged. Then (12. 3D can be restated as 

n 

Dper(A)(X) =^per^(i;X). (2.4) 

i=i 

From (II. ID . we note that D per A{X) is the coefficient of t in the polyno- 
mial per {A + tX). Using the fact that per is a linear function of each of the 
columns, we immediately obtain (I2.4D . 



The Laplace expansion theorem for permanents ( lfT3ll . page 16) says that 
for any 1 < m < n and for any X G Qm,n, 

per^= VeiA[I\J]peTA{I\J). (2.5) 

In particular, for any i,l < i < n, 

n 

per^ = ^ aij per (2.6) 
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Using this, equation (12 ■4D can be rewritten as 

n n 

Dper(yl)(X) = ^5]xi,perA(i|j). (2.7) 

i=l 3=1 

The following two theorems are analogues of theorems 1 and 2 of ||8l] 
and also generalisations of equations (I2.4D and (12 .70 respectively. The 
proofs are also similar to the proofs in JS]! . 

Theorem 2.2. For 1 < m < n, 

D"'pei{A){X\...,X"')= J2 per^(J^;X'^«,X'^(2)^___^;sca(m))_ 

m.n 

(2.8) 

In particular, 

B"'pev{A){X,...,X) = m\ J2 per A{J; X, . . . , X). 

Proof. From (O), D™ per . . . , X"^) is the coefficient of ti • • • in 

the expansion of per {A + tiX^ + • ■ • + tmX"^). Using this and linearity of 
the per function in each of the columns, we obtain 02. 8D . ■ 

Theorem 2.3. For 1 < m < n, 

D™per(.4)(X\...,X™) = ^ ^ per J) per y^^^] [X|J]. (2.9) 

In particular, 

D™per(A)(X,...,X) = m! ^ j>ei A{1\J) per X[I\J]. 

Proof. For each J" G Qm.n, the Laplace expansion theorem gives 

per^(J;X'^«,...,X<^('-)) = ^ per^(X|J) peryf;^][X|J]. 

Equation ( I2.9D is obtained by expanding each term in the summation of 
(12. BP in this manner. ■ 

We note here that 

per {A){X, ...,X) = n\ per X, (2.10) 

and 

D"'per(yl)(X,... ,X) = V m > n. (2.11) 
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For X^,... , X'^ G ^(Ji), consider the following operator on ®"^'H: 

± y ^X'"(2) ^ ... (-2.12) 



It leaves the space V"*?^ invariant. We use the notation X W V • • • V 
for the restriction of this operator to the subspace y^H. 

For m X m matrices r\ . . . , T™, denote by Ap(r\ . . . , T"") the quantity 

^ per 



rpa(l) rpCj{m) 



This quantity occurs in [T] and has been termed as the mixed permanent of 
r\...,r™. When allT-' =T, 

Ap(r,...,r) = perT. 

We make the following observations. 

1. For n X n matrices X^, . . . , X™ and Z,J^ Gm,n, the(I, J")-entry of 
X^ V • • • V X™ is 

{m{I)m{J))-'/^Ap{X'[I\J], X"[X|J]). (2.13) 

Proof. The (X, J) -entry of X^ V • • • V X™ is given by 

[m{I)-'/\e,, V • • • V e.^), (XW • • • V X-)(m( J)-V2(e^.^ v • • • V e,J)) 



m(I)m(J))-i/2 



E 



lJ,T,riGSm P=l 



Ji7(l) ^ ^ Jrj{m) / 



(m!)^ 
m(Z)m(J))-i/2 



(771! 



1^2 



m 

E n (e.„X-''-^°^('')e,^(,,)(putting7 = r?oa-i) 



m(X)m(J))-V2 



■J7{9) 



U],^&S,n 9=1 

by putting i:^ = r o and observing that as r and rj vary in Sm, varies 
in Sm ml times.) 

m(X)m(J))-i/2 



ml 



E n(^9'-?'7(9))-^ntryof 

W,7gSm 9=1 

[(X-W[Z|:7])[i],...,(X-M[Z|J])h] 
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Til . 

= (m(X)m(J))-V2Ap(Xi[X|^], . . .,X^[I\J]). 



2. ForX,jrGQ^., 



E peryf^][Z|J] = m! Ap(XnX|J], . . . (2.14) 

Proof. 



E Per>[^][2:|J] 

treSm 

m 

E E n(V'J7{p))-entryofy[^] 

m 

E E n(V>J7(p))-entryofX-^(f) 

m 



m 

= E E n(V>J7(p))-entryof[(X-W[X|J])[i],...,(X-M[X|j7])[„]] 
= E per[(X'^«[X|J])[i],...,(X'^M[Z|J])[^]] 
= m! Aj,(xMX|J],...,X"^[X|J]). 



In view of these observations, it is easy to derive two reformulations of 
theorem 2.3 given below as theorems 2.4 and 2.5. 

Theorem 2.4. For 1 <m <n, 

B"'pei{A){X\...,X"')=m\ E pei A{I\J) Ap{X^[I\J], . . . ,X^[I\J]). 

(2.15) 

Let Pk be the canonical projection of y^H onto the subspace gener- 
ated by {cq : a G Qk,n}- If we vary a,P in Qk,n, we get the submatrix 
Pfe(v'^A)Pfe of v'^^. The matrix padj(^)-^ can be identified with a submatrix 
of an operator on the space V"~^?^. We call this operator \/"'~^A. Then 
padj(^)-^ = Pn-i{\/"^~^ A)Pn-i, which is an n x n matrix. It is unitarily sim- 
ilar to the transpose of Pn-i(V"~^^)-Pn-i- Similarly, consider the transpose 
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of the matrix whose (a, /3)-entry is per ^(a|/3). It can be identified as a sub- 
matrix of an operator on the space V^'^'Ti. We call this operator v"^^yl. 
The Q X Q) submatrix P„_fe(v""^A)P„_A: of v"'^''A is unitarily similar to 
the transpose of P„_fc(V"~'^^)P„_fc, which is a submatrix of V"~'^^. 
Then equation (I2.3D can also be written as 

Dper{A){X) = tr (P„_i(v"-'^)P„_i)X. (2.16) 

Its generalisation for higher order derivatives can be given as follows: 

Theorem 2.5. For 1 <m < n. 



B'"peT{A){X\...,X' 



ml tr 



P (y^ 

^ n~m\ V 



V 



V X'")P„ 



.(2.17) 



/n particular, 

D'"per(^)(X,...,X) 



m! tr 



(P^(V'"X)P„ 



Proof. For X, G Qm,n, the (X, J^)-entry of Pm{X^ V • • • V X'^)P^ is 
Ap(Xi[X|J],...,X-[I|J]). Also, (J,X)-entryof P„„fc(v"-'=A)P„_fe is 
peiA{X\J), by definition of y'^'^A. Equation (I2.17D is now clear from 
(IZT5D . ■ 



3 Symmetric tensor power 

Consider the map : M(n) M(("+^"^)) which takes an n x n matrix 
A to its /cth symmetric tensor power. For elements 7' = (71, •••,7^) £ 
Gm,n and a = (ai, . . . , at) € Gfc,nj we write ^'<^aiil<'m<k<n 
and {7[, . . . , 7^} C {a^, . . . , a^} with multiplicities allowed such that if 
occurs in a, say, da times then cannot occur in 7' for more than da times. 
Also if 7' C a, then a — 7' will denote the element (71, ... , jk-m) of Gk-m,n 
where 7^ G {ai, . . . , a^} such that 7^ is some and occurs in a — 7' exactly 
da — dy times where da and d^/ denote the multiplicities of in a and in 
7' respectively. 

Let y be a ("+^-1) x matrix and for 1 < m < /fc < n, let 

7,5 G We denote by y('=)(7, (5), the ("+^~^) x ("+^"^) matrix whose 

indexing set is Gk^n and for a,/3 G Gk^n, the (a,/3)-entry of y('^)(7, 5) is 

( "'tT2M%"'0 (« - 7, /? - '5)-entry of y if 7 C « and <5 C /? and 

zero otherwise. 

Theorem 3.1. Let A G M(n). Then for 1 < m < k < n, 

D'"V^(A)(X\...,X") =m! ^ perA[7|(^] (X^ V--- VX'^)(*^)(7,5) 

(3.1) 
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Proof. We show that for a,(3 ^ Gk,n, the (a, /3)-entry on both the sides is 
equal. First a lemma: 

Lemma 3.2. If f and g are two maps such that the composite f o g is well 
defined and g is linear, then 

D-(/ o g){A){X\. . . , = D^f{g{A)){g{X^), g{X^)). (3.2) 

Proof. This follows from (fOl). ■ 

Now, for a,^ G Gk,ni, the (a,/3)-entryof v''(A) is (m(a)m(/3))~^/2 per^[a|/3]. 
In the lemma above, taking / to be the per map and g : M(n) — ^ M(A;) as 
the linear map which takes A to 74[a|/3], it follows that the (a, /3)-entry of 
D™ V'^ (.4)(Xi,...,X™) is 

(m(a)m(/3))"i/2 j^m ^^^{^A[a\P]){X^ . . . , X"^[a|/3]). 
So by (IZT5D . the (a, /3)-entry of D"* v'^ {A)(X^ X™) is 

m! (m(a)m(/3))-V2 ^ per(A[a|/3])(X|J) Ap((Xi[a|/3])[X|:7], . . . , (X-[a|/3])[X|:7]) 

(by dUl) 

= m! (m(a)m(/3))-i/2 ^ per^[a - yi/3 - 6'] Aj,{X'W\5'], . . .,X^W\5']). 

-y'Ca, (5'C/3 

= m! (m(a)m(/3))-^/2 ^ per A[-f\6] Ap{X^ [a — 7|/3 — 5], ... , X"^[a — ^\f3 — 6]). 

■yCa, 5Ci3 

(3.3) 

The last expression is equal to m! 2^ per A[7|()J I — — — — I times the 

7C0, 5C0 

(a - 7, /3 - (5)-entry of (xW • • • V X'' 



which is same as ml 

J2 (per A[7|5]) times the (a, /3)-entry of (XW • • • V X™)('=)(7, 6). 

This is the {a, /3) -entry of 

m! ^ per^[7|<5] (X^ V--- VX'")W(7,5). 



Theorem 3.3. Let A G M(n). Then for 1 <m<k <n, 

kl 

{k — m)\ 



ID™ v'' A\\ = ^' IIAII^"'". 
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Proof. We use ideas similar to those in [5] and Q . We first show that the 
left hand side of the above equation is smaller than the right hand side. For 
this, we need a lemma. 

Lemma 3.4. For 1 < m < k < n, 

0-e5m jr>0 

jlA \-jm + i=k-m 

(3.4) 

From this it follows that 

h\ 

||D'"®'=^|| = — Mf-™. (3.5) 

[k — m)\ 

Proof. Equation (I3.4D can be obtained easily since its left hand side is the 
coefficient of ti • • • t„i in the expansion of ®^{A + tiX^ + • • • + tmX'"^) . For 
the norm, the < inequality follows by the triangle inequality. The equality 
is attained at the tuple (yl/||yl||, . . . ,^/||A||). ■ 

Let Sk ■ ^'^T-L — > y^T-L be the projection given by 

Sk{xi • • • Xfc) = ^ (g) • • • 2;^(fc) 

and let Qk : y'^U ®^1-L be the inclusion map. Define Sk ■ .^{(^^Ti) 
.^{y^-H) by 

Sk{T) = SkTQk. 

Then Sk is a projection and so \\Sk\\ < 1. Also v''' : if ("H) if (v'^'H) factors 
through 0^ : if (?^) ^ ^{(^^U) via Sk as follows. 

y^A = Sk{®^A) yAe^in). 

Since Sk is linear, therefore 

D™ V^' A = Sk oD"'®'' A. 

So, 

||D"^V*^ All < ||5fc||||D"O^A|| < — -^l—pf-'^ (3.6) 

[k — m)'. 

Now, we show that for = • • • = X™ = /, HD™ v'^ (A) (/,..., /)|| > 
(k-m)\ ll^ll^"'"- Let A'^ = UAW be the singular value decomposition of A 
where U and W are nxn unitary matrices. Then y^U and y'^W are unitary 
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matrices of order ("^l'^). For X\ . . . , X"" G M(n), 
= ||(V'=?7)(D™ V'^ {A){X^,...,X'^)){y^W)\\ 

Qva 



Qr. 



dti ■ ■ ■ dtr, 



dti--- dtm 

Qva 



y^ {A + txX^ + • ■ ■ + t^X"^) {\J^W) 

ii=-=im=0 / 



tl— tm— 



tl— im— 



{A^ + ti^/X^T^ + • • • + tmUX'^W) 



dti ■ ■ ■ dtr, 

ID'" v'^ iA^){UX^W,...,UX"'W)\\. 



Since = ||X'|| and {UXW : ||X|| = 1} includes all the matrices of 

norm 1, therefore 



||D™ V'^^ll 



sup IID™ V'=(A)(X\...,X"^)|| 

||Xi|| = ...=||X-||=l 

sup \\B"'\/'' {A^){UX^W,...,UX"'W)\\ 

||Xi|| = ...=||X-|i=l 

||D™ ^^||. 
/ siiA) 



\ 



Now, let A'^ 
From (fOD . 

V'^ (At) (/,..., I) 



, where si{A) > ■ ■ ■ > Sn{A) > 0. 



SniA) 



dti ■ ■ ■ dtr, 



fl=- = fm=0 



v'' {A^ + til+--- + t^l). 



The matrix \/''{A'^ + til + • • • + tml) is also a diagonal matrix. Its top left 
entry is (a, a)-entry for a = (1, ... , 1), which is 

m{a)~^ per((y4t + tj + h tm/)[ala]). 

This is equal to (A;!)^^ times permanent of the k x k matrix each of whose 
entries are si + ti + ■ ■ ■ + tm- So the top left entry is (si + ti + • • • + tm)^- 
Now 



Qm 
dti ■ ■ ■ dtr 

So 



-{si+ti + - ■ ■ + trri)^ = k{k-l) ■ ■ ■ {k-m+l){si+ti + - ■ ■ + tr 



\k—m 



dti ■ ■ ■ dtr 



(Si + ti + --- +tr 



k\ 



k—m 



il=---=tm=0 



(A; — m)\ 



I "1 
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Thus, 

If] 

||D- V'^ {A){I, . . . , I) II = ||D- (.4t)(/, ...,/) II > • J Af-"^. (3.7) 

Equations (I3.6D and (I3.7D give the desired result. ■ 
Corollary 3.5. For n x n matrices A and X, 

f A I \^ \ \ / ^ ^ 1 1 /II /ill I II 1 1 \ ^ II ^11^ 



II V'^ {A + X)-y''A\\ < (p|| + ||X||)* 
Proof. By Taylor's theorem, 

" 1 

II V^'(A + X)-V'=^|| = liy — D'"V'=(^)(X,...,X)|| 



n ^ 



fe=l 



< E^iiD"^v'=^iiiixr 

m! 

m=l 

m! (fe — ml! 

m=l ^ ' 



E (^)pf-'"iix|| 

m=l \ / 



= (P|| + ||X||)'=-pf. 

■ 

Corollary 3.6. For I < m < n, 

71 ' 

||D-peryl|| <^ ^Pf-'" (3.8) 

(n — mj! 

and 

I per(^ + X)- per A| < (P|| + ||X||)- - Pf . (3.9) 
Proof. Since per A is (a, a)-entry of V"^ for a = (1, ... , n), we get 

||D^'"peryl|| < IID" V".4|| = -— " '^'"^""^ 

(n — mj! 

and again by Taylor's theorem, 

|per(^ + X)-perA|<(P|| + ||X||r 



Consider the simplest commutative case: A = I, X = xl. Then the 
expression on both the sides of the inequality ( I3.9D is 



n 



E 



n 



k=l 

So no improvement on the perturbation bound of per is possible in this 



sense. 
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Remarks 

1. Since per ^ is the (a, a)-entry of V"yl for a = (1,2, .. . D™per(yl)(X^ 
is the (a, a)-entry of D™ V" {A){X'^ X™) which by equation (l331) 
equals 

ml J2 per A[7l^] ApiX'[a - ^\(3 -6],.. .,X"'[a - 7|/3 - 5]) 

7,5eG„ 



7,(5Ca 



m! 



^ per A(7'|5') Ap(Xi [7' . . . , X-[7'|5']) 



= E per^(7V)pery[^,][7>'] (by (HH) 

m 'y' ,0' GQm,n 

which is same as equation ( I2.9D . 

2. The bound on ||D'"per A|| can also be obtained directly from (I2.17D . 
First note that if A is an n x n matrix, then 



<PI|i<np||. (4.1) 
Also, if si{A) > ■ ■ ■ > Sn{A) are the singular values of A, then 

Wv'' A\\ = siiA)'' = \\Af (4.2) 

and if IIX-'II = 1 for all j then \\PmiX'^ V ••• VX™)P™|| < 1. 
The trace norm \\ ||i is the dual of the operator norm and so 

1 = sup \trAX\. (4.3) 

11^11=1 



Now, 



|D"^per^||= sup ||D™ per . . . , X'") || 

||Xi|| = ---=||X™||=l 

= m! sup I tr [(P„_m(v"""'A)P„_m) 

||Xi||=--- = ||X'"||=l 

iP^{x' y ■ ■ ■ V x"')Pm)]\ 

<m!||P„_„(v"-™.4)P„_^||i (by(E3D) 

< m! ( " ) \\Pn-m (V"~™^) Pn-m\\ (usiug ^) 

\rn I 

< mlf 111 V^-'^^ll 



m , 



(n — m)! 



(by (El) 
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